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(57) ABSTRACT 

A video encoding method and apparatus for adapting a video 
input to a bandwidth of a transmission channel of a network 
that includes determining the number N enhancement layer 
bitstreams capable of being adapted to the bandwidth of the 
transmission channel of a network. Abase layer bitstream is 
encoded from the video input wherein a plurality of 
enhancement layer bitstreams are encoded from the video 
input. The enhancement layer bit-streams are based on the 
base layer bitstream, wherein the plurality of enhancement 
layer bitstreams complements the base layer bitstream and 
the base layer bitstream and N enhancement layer bitstreams 
to are transmitted to the network. 
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SCALABLE VIDEO CODING METHOD AND 
APPARATUS 

This application claims the benefit of and priority from 
Provisional Application No. 60/093,860, filed in the United 
States Patent and Trademark Office Jul. 7, 1998, entitled "A 
Scalable Video Coding Method" by inventor Weiping Li, 
which is hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a method and apparatus 
for the scaling of data signals the bandwidth of the trans- 
mission channel; and more particularly to a scalable video 
method and apparatus for coding video such that the 
received video is adapted to the bandwidth of the transmis- 
sion channel. 

2. Description of Related Art 

Signal compression in the video arena has long been 
employed to increase the bandwidth of either the generating, 
transmitting, or receiving device. MPEG — an acronym for 
Moving Picture Experts Group — refers to the family of 
digital video compression standards and file formats devel- 
oped by the group. For instance, the MPEG-1 video 
sequence is an ordered stream of bits, with special bit 
patterns marking the beginning and ending of a logical 
section. 

MPEG achieves high compression rate by storing only the 
changes from one frame to another, instead of each entire 
frame. The video information is then encoded using a 
technique called DCT (Discrete Cosine Transform) which is 
a technique for representing a waveform data as a weighted 
sum of cosines. MPEG use a type of lossy compression 
wherein some data is removed. But the diminishment of data 
is generally imperceptible to the human eye. It should be 
noted that the DCT itself does not lose data; rather, data 
compression technologies that rely on DCT approximate 
some of the coefficients to reduce the amount of data. 

The basic idea behind MPEG video compression is to 
remove spatial redundancy within a video frame and tem- 
poral redundancy between video frames. The DCT-based 
(Discrete Cosine Transform) compression is used to reduce 
spatial redundancy and motion compensation is used to 
exploit temporal redundancy. The images in a video stream 
usually do not change much within small time intervals. 
Thus, the idea of motion-compensation is to encode a video 
frame based on other video frames temporally close to it. 

A video stream is a sequence of video frames, each frame 
being a still image. A video player displays one frame after 
another, usually at a rate close to 30 frames per second. 
Macrblocks are formed, each macroblock consists of four 
8x8 luminance blocks and two 8x8 chrominance blocks. 
Macroblocks are the units for motion-compensated 
compression, wherein blocks are basic unit used for DCT 
compression. Frames can be encoded in three types: intra- 
frames (I-frames), forward predicted frames (P-frames), and 
bi-directional predicted frames (B-frames). 

An I-frame is encoded as a single image, with no refer- 
ence to any past or future frames. Each 8x8 block is encoded 
independently, except that the coefficient in the upper left 
corner of the block, called the DC coefficient, is encoded 
relative to the DC coefficient of the previous block. The 
block is first transformed from the spatial domain into a 
frequency domain using the DCT (Discrete Cosine 
Transform), which separates the signal into independent 
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frequency bands. Most frequency information is in the upper 
left corner of the resulting 8x8 block. After the DCT 
coefficients are produced the data is quantized, i.e. divided 
or separated. Quantization can be thought of as ignoring 

5 lower-order bits and is the only lossy part of the whole 
compression process other than subsampling. 

The resulting data is then run-length encoded in a zig-zag 
ordering to optimize compression. The zig-zag ordering 
produces longer runs of 0's by taking advantage of the fact 

10 that there should be little high-frequency information (more 
0's as one zig-zags from the upper left corner towards the 
lower right corner of the 8x8 block). 

A P-frame is encoded relative to the past reference frame. 
A reference frame is a P- or I-frame. The past reference 

15 frame is the closest preceding reference frame. A 
P-macroblock is encoded as a 16x16 area of the past 
reference frame, plus an error term. 

To specify the 16x16 area of the reference frame, a motion 
vector is included. A motion vector (0, 0) means that the 

20 16x16 area is in the same position as the macroblock we are 
encoding. Other motion vectors are generated are relative to 
that position. Motion vectors may include half-pixel values, 
in which case pixels are averaged. The error term is encoded 
using the DCT, quantization, and run-length encoding. A 

25 macroblock may also be skipped which is equivalent to a (0, 
0) vector and an all- zero error term. 

A B-frame is encoded relative to the past reference frame, 
the future reference frame, or both frames. 

30 A pictorial view of the above processes and techniques in 
application are depicted in prior art FIG. 15, which illus- 
trates the decoding process for a SNR scalability. Scalable 
video coding means coding video in such a way that the 
quality of a received video is adapted to the bandwidth of the 

3S transmission channel. Such a coding technique is very 
desirable for transmitting video over a network with a 
time-varying bandwidth. 

SNR scalability defines a mechanism to refine the DCT 
coefficients encoded in another (lower) layer of a scalable 

4Q hierarchy. As illustrated in prior art FIG. 15, data from two 
bitstreams is combined after the inverse quantization pro- 
cesses by adding the DCT coefficients, Until the dat is 
combined, the decoding processes of the two layers are 
independent of each other. 

45 The lower layer (base layer) is derived from the first 
bitstream and can itself be either non-scalable, or require the 
spatial or temporal scalability decoding process, and hence 
the decoding of additional bitstream, to be applied. The 
enhancement layer, derived from the second bitstream, con- 

50 tains mainly coded DCT coefficients and a small overhead. 
In the current MEPG-2 video coding standard, there is an 
SNR scalability extension that allows two levels of scalabil- 
ity. MPEG achieves high compression rate by storing only 
the changes from one frame to another, instead of each entire 

55 frame. There are at least two disadvantages of employing the 
MEPG-2 standard for encoding video data. One disadvan- 
tage is that the scalability granularity is not fine enough, 
because the MEPG-2 process is an all or none method. 
Either the receiving device can receive all of the data from 

60 the base layer and the enhancement layer or only the data 
from the base layer bitstream. Therefore, the granularity is 
not scalable. In a network environment, more than two 
levels of scalability are usually needed. 
Another disadvantage is that the enhancement layer cod- 

65 ing in MEPG-2 is not efficient. Too many bits are needed in 
the enhancement layer in order to have a noticeable increase 
in video quality. 
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The present invention overcomes these disadvantages and received enhancement layer bitstreams complements the 

others by providing, among other advantages, an efficient base layer bitstream. Then reconstructing the base layer 

scalable video coding method with increased granularity. bitstream and N enhancement layer bitstreams. 

CITM „ ADV ™ ~„t: Ivnn:NTTinM According to still another aspect of the present invention, 

SUMMARY OF THE INVENTION 5 a yideo decoding method fof adapting a y{dcQ ^ tQ a 

The present invention can be characterized as a scalable bandwidth of a receiving apparatus, the method includes 

video coding means and a system for encoding video data, demultiplexing a base layer bitstream and at least one of a 

such that quality of the final image is gradually improved as plurality of enhancement layer bitstreams received from a 

more bits are received. The improved quality and scalability network, decoding the base layer bitstream, decoding at least 

are achieved by a method wherein an enhancement layer is 10 one °f me plurality of enhancement layer bitstreams based 

subdivided into layers or levels of bitstream layers. Each on generated base layer bitstream, wherein the at least one 

bitstream layer is capable of carrying information comple- of the plurality of enhancement layer bitstreams enhances 

mentary to the base layer information, in that as each of the the base layer bitstream. Then reconstructing a video output, 

enhancement layer bitstreams are added to the correspond- According to a further aspect of the present invention, a 

ing base layer bitstreams the quality of the resulting images 15 video encoding method for encoding enhancement layers 

are improved. based on a base layer bitstream encoded from a video input, 

The number N of enhancement layers is determined or the video encoding method includes, taking a difference 

limited by the network that provides the transmission chan- between an original DCT coefficient and a reference point 

nel to the destination point. While the base layer bitstream and dividing the difference between the original DCT coef- 

is always transmitted to the destination point, the same is not ficient and the reference point into N bit-planes, 

necessarily true for the enhancement layers. Each layer is According to a still further aspect of the present invention, 

given a priority coding and transmission is effectuated a method of coding motion vectors of a plurality of 

according to the priority coding. In the event that all of the macroblocks, includes determining an average motion vec- 

enhancement layers cannot be transmitted the lower priority 2$ tor from N motion vectors for N macroblocks, utilizing the 

coded layers will be omitted. The omission of one or more determined average motion vector as the motion vector for 

enhancement layers may be due to a multitude of reasons. the N marcoblocks, and encoding 1/N motion vectors in a 

For instance, the server which provides the transmission base layer bitstream. 

channel to the destination point may be experiencing large Additional features and advantages of the invention will 

demand on its resources from other users, in order to try and 30 be set forth in the description which follows, and in part will 

accommodate all of its users the server will prioritize the be apparent from the description, or may be learned by 

data and only transmit the higher priority coded packets of practice of the invention. The aspects and other advantages 

information. The transmission channel may be the limiting of the invention will be realized and attained by the structure 

factor because of the bandwidth of the channel, i.e. Internet particularly pointed out in the written description and claims 

access port, Ethernet protocol, LAN, WAN, twisted pair 35 hereof as well as the appended drawings, 

cable, co-axial cable, etc. or the destination device itself, i.e. u ^ tQ be mat ^ me foregoing general 

modem, absence of an enhanced video card, etc. may not be description and the following detailed description are exem- 

able to receive the additional bandwidth made available to it. plary arjd explanatory and are intended to provide further 

In these instances only M number (M is an integer number- explanation of the invention as claimed. 

0, 1, 2, . . . ) of enhancement layers may be received, wherein ^ 

N number (N is an integer number-0, 1, 2, . . . ) of BRIEF DESCRIPTION OF THE DRAWINGS 
enhancement layers were generated at the encoding stage, 

MSjsf. The accompanying drawings, which are included to pro- 

To achieve these and other advantages and in accordance vide » further understanding of the invention and are incor- 

with the purpose of the present invention, as embodied and 4S mandcon^utute apartof thisspec.ficat.on.iUustrate 

broadly described, the scalable video method and apparatus ^bodnnents of the mvention and together w,th the descnp- 

according to one aspect of the invention includes a video jj^j*™ 10 CXpliUn ** P nnci P les of ,he ^nUon. In the 

encoding method for adapting a video input to a bandwidth ^ ' 

of a transmission channel of a network, the method includes FIG - 1 illustrates a flow diagram of the scalable video 

determining the number N of enhancement layer bitstreams 50 encodin g method of the present invention; 

capable of being adapted to the bandwidth of the transmis- FIG. 2A illustrates conventional probability distribution 

sion channel of the network. Encoding a base layer bitstream of DCT coefficient values; 

from the video input is then performed and encoding N FIG. 2B illustrates conventional probability distribution 

number of enhancement layer bitstreams from the video of DCT coefficient residues; 

input based on the base layer bitstream, wherein the plurality 55 FIG. 3A illustrates the probability distribution of DCT 

of enhancement layer bitstreams complements the base layer coeffici ent values of the present invention; 

bitstream. The base layer bitstream and the N enhancement ni/ ^ ... , t . . .... ., . c 

layer bitstreams are then provided to the network. F ]P: 3 f ^""l P robablht y Unbutton of DCT 

\ , , coefficient residues of the present mvention; 

According to another aspect of the present mvention, a nir , c ^ . ^ ... t t . _ , 

video decoding method for adapting a video input to a 60 J lGS ' 3C f *£i D grates a method for taking a 

bandwidth of a transmission channel of a network includes, dlfference of a coefficient of the present mvention; 

determining number M of enhancement layer bitstreams of FIG - 4 iUustrates a flow diagram for difference coding, 

said video input capable of being received from said trans- FIG. 5 illustrates a flow diagram for finding the maximum 

mission channel of said network. Decoding a base layer number of bit-planes in the DCT differences of a frame of 

bitstream from received video input and decoding M number 65 me present invention; 

of enhancement layer bitstreams from the received video FIG. 6 illustrates a flow diagram for generating (RUN, 

input based on the base layer bitstream, wherein the M EOP) Symbols of the present invention; 
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FIG. 7 Illustrates a flow diagram for encoding enhance- In a second method the server 50 knows the transmission 

ment layers of the present invention; channel 60 condition, i.e. congestion and other physical 

FIG. 8 illustrates a flow diagram for encoding (RUN, constraints, and selectively sends the bitstrcams to the 

EOP) symbols and sign_enh values of one DCT block of channel according to the priority identifiers. In either case, 

one bit-olane* 5 destination point 100 receives the bitstream for the base 

^ ' . . layer aod M bitstreams for the enhancement layer, where 

FIG. 9 illustrates a now diagram for encoding a sign enh M— N 

value of the present invention; ^ e bitstreams M arc ^ to the base layer 90 and 

FIG. 10 illustrates a flow diagram for adding enhance- enhancement layer 80 decoders after being demultiplexed by 

ment difference to a DCT coefficient of the present inven- demultiplexer 70. The decoded enhancement information 

tion; from the enhancement layer decoder is passed to the base 

FIG. 11 illustrates a flow diagram for converting enhance- layer decoder to composite the reconstructed video output 

ment difference to a DCT coefficient of the present inven- 1 00. The decoding of the multiplexed bitstreams are accom- 

ti on . plished pursuant to the methods and algorithms depicted in 

n G . 12 illustrates a flow diagram for decoding enhance- 15 fl ™ di <f a™ HOO-1400 of FIGS .11-14, respectively. 

ment layers of the present invention; , n ? ba f encoder ^^cri*™ ™ff£SU°£ 

mn ii -11 ♦ ♦ fl ,r c a a- /oirxr forming logic pursuant to the MPEG-1, MPEG-2, or 

^ \ U ^ a ! CS " ho \^ ram *° r de ^? 1 (R t UN l MPEG-4 (Version-1) standards that are hereby incorporated 

EOP) symbols and sign_enh values of one DCT block of by re f ererjce into this disclosure, 

one bit-plane; Taking Residue with Probability Distribution Preserved 

FIG. 14 illustrates a flow diagram for decoding a sign_ 20 A detailed description of the probability distribution resi- 

enh value; and due will now be made with reference to FIGS. 2A-3B. 

FIG. 15 illustrates a prior a conventional SNR scalability In the current MEPG-2 signal-to-noise ratio (SNR) seal- 
flow diagram. ability extension, a residue or difference is taken between the 

original DCT coefficient and the quantized DCT coefficient. 

DETAILED DESCRIPTION OF THE 25 FIG. 2A illustrates the distribution of a residual signal as a 

PREFERRED EMBODIMENTS DCT coefficient. In taking the residue small values have 

^ . .„ , higher probabilities and large values have smaller probabili- 

Reference will now be made m detail to the preferred ties ^ e intervals ^ the horizonta i axis represent quan- 

embodiments of the present invention, examples of which tization bins m dot m mc ceatcr 0 f each interval represents 

are illustrated in the accompanying drawings. 3Q me quantized DCT coefficient. Taking the residue between 

FIG. 1 illustrates the scalable video diagram 10 of an the original and the quantized DCT coefficient is equivalent 

embodiment of the present invention. The original video to moving the origin to the quantization point, 

input 20 is encoded by the base layer encoder 30 in accor- Therefore, the probability distribution of the residue 

dance with the method of represent by flow diagram 400 of becomes that as shown in FIG. 2B. The residue from the 

FIG. 4. A DCT coefficient OC and its corresponding base positive side of FIG. 2A has a higher probability of being 

layer quantized DCT coefficient QC arc generated and a negative than positive and the residue taken from the nega- 

difference determined pursuant to steps 420 and 430 of FIG. tive side of & c FIG * 2A has a m *g her probability of being 

4. The difference information from the base layer encoder 30 positive than negative. The result is that the probability 

is passed to the enhancement layer encoder 40 that encodes distribution of the residue becomes almost uniform. Thus 

the enhancement information. M m ^ <? dm S residu t ? T re f^ 11 ' , A .„ 

_ i- . . . i 40 A vastly superior method is to generate a difference 

Hie encoding of the enhancement layer encoder is per- betwcCQ ^ original ^ the lowcr points of the 

formed pursuant to methods 50O-900 as depicted in FIGS. quantized interval as shown in FIG. 3A and FIG. 3B. In this 

5-10, respectively and will be briefly described. The bit- method, the residue is taken from the positive side of FIG. 

stream from the base layer encoder 30 and the N bitstreams 2 A remains positive and the residue from the negative side 

from the enhancement layer encoder 40 are capable of being 45 Q f pjQ. 2A remains negative. Taking the residue is equiva- 

sent to the transmission channel 60 by at least two methods. i ent to mov ing the origin to the reference point as illustrated 

In the first method all bitstreams are multiplexed together in FIG. 3A Thus, the probability of the residue becomes as 
by multiplexor 50 with different priority identifiers, e.g., the shown in FIG. 3B. This method preserves the shape of the 
base layer bitstream is guaranteed, enhancement bitstream original non-uniform distribution. Although the dynamic 
layer 1 provided by enhancement layer encoder 40 is given so range of the residue taken in such a manner seems to be 
a higher priority than enhancement bitstream layer 2. The twice of that depicted in FIG. 2B, their is no longer a need 
prioritization is continued until all N (wherein N is an to code the sign, i.e. - or +, of the residue. The sign of the 
integer from 0, 1, 2, . . . ) of the bitstreams layers are residue is encoded in the base layer bitstream corresponding 
prioritized. Logic in the encoding layers 30 or 40 in nego- the enhancement layer, therefore this redundancy is elimi- 
nation with the network and intermediated devices deter- ss nated and bits representing the sign are thus saved, 
mine the number N of bitstream layers to be generated. Therefore, there is only a need to code the magnitude that 

The number of bitstream layers generated is a function of still has a non-uniform distribution, 
the total possible bandwidth of the transmission channel 60, Bit Plane Coding of Residual DCT Coefficients 
i.e. Ethernet, LAN, or WAN connections (this list is not After taking residues of all the DCT coefficients in an 8x8 
intended to exhaustive but only representation of potential 60 block, bit plane coding is used to code the residue. In 
limiting devices and/or equipment), and the network and bit-plane coding method the bit-plane coding method con- 
other intermediate devices. The number of bitstream layers siders each residual DCT coefficient as a binary number of 
M (wherein M is an integer and M^N) reaching the desti- several bits instead of as a decimal integer of a certain value 
nation point 100 can be further limited by not just the as in the run-level coding method. The bit-plane coding 
physical constraints of the intermediate devices but the 65 method in the present invention only replaces runlevel 
congestion on the network, thereby necessitating the drop- coding part. Therefore, all the other syntax elements remain 
ping of bitstream layers according to their priority. the same. 
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An example of and description of the bit-plane coding Converting the bits of each bit-plane into (RUN, EOP) 

method will now be made, wherein 64 residual DCT coef- symbols results in the following: 
ficients for an Inter-block and 63 residual DCT coefficients $ t ^ (MSB-plane) 

for an Intra-block (excluding the Intra-DC component that is ' ^ , c . wca , . 

ii- i ix * (2, 1) (Second MSB-plane) 

coded using a separate method) are utilized for the example. 5 

Hie 64 (or 63) residual DCT coefficients are ordered into a & °>> ^ < 2 *°>- ^ <°> °>> <*■ « MSB-plane) 

one-dimensional array and at least one of the residual ( 5 > 0), (8, l) (Fourth MSB-plane or LSB-piane) 

coefficients is non-zero. The bit-plane coding method then Therefore, there are 10 symbols to be coded using the 

performs the following steps. (RUN, EOP) VLC tables. Based on their locations in the 

The maximum value of all the residual DCT coefficients 10 bit-planes, different VLC tables are used for the coding. The 

in a frame is determined and the minimum number of bits, enhancement bitstream using all four bitplanes looks as 

N, needed to represent the maximum value in the binary follows: 
format is also determined. N is the number of bitplanes code leading-all-zero(2) 

layers for this frame and is coded in the frame header. code ms tj(o, i) 

Within each 8x8 block is represent every one of the 64 (or 15 oode msb .^ 2 1) 

63) residual DCT coefficients with N bits in the binary 

format and there is formed N bit-planes or layers or levefc. ^^^^^^^t^ 
A bit-plane is denned as an array of 64 (or 63) bits, taken one codc_msb-3(8, 3). 

from each residual DCT coefficient at the same significant ^ Iq m alternative embodiment, several enhancement bit- 
position, streams may be formed from the four bit-planes, in this 
^ The most significant bit-plane is determined with at least example from the respective sets comprising one or more of 
^one non-zero bit and then the number of all -zero bit-planes the four bit-planes, 
"between the most significant bit-plane determined and the Motion Vector Sharing 

Nth one is coded Then starting from the most significant bit In this alternative embodiment of the present invention 

plane (MSB plane), 2-D symbols are formed of two com- 25 motion vector sharing is capable of being utilized when the 

ponents: (a) number of consecutive O's before a I (RUN), base lavcr bitstream exceeds a predetermined size or more 

(b) whether there are any l's left on this bit plane, i.e. levels of scalability are needed for the enhancement layer. 

End-Of-Plane (EOP). If a bit-plane after the MSB plane B Y lowering the number of bits required for coding the 

contains all O's, a special symbol ALL-ZERO is formed to motion vectors in me base laver me bandwidth requirements 

represent an all-zero bit-plane. Note that the MSB plane 30 of ^ ba j? la >™ bi»eam is reduced In base layer coding, 

does not have the all-zero case because any all-zero bit- a macroblock (16x16 pixels for the lummance component 

planes before the MSB plane have been coded in the and W for each ^^T^ wm P onents ) ° f the 

r . v current frame is compared with the previous frame within a 

previous s ps. search range. The closest match in the previous frame is used 

Four 2-D VLC tables are used, wherein the table VT-C- 35 as a prediction of the current macroblock. The relative 

Table-0 corresponds to the MSB plane; table VLC-Table-I displacement of the prediction to the current macroblock, in 

corresponds to the second MSB plane; table VLC-Table-2 the horizontal and vertical directions, is called a motion 

corresponds to the third MSB plane; and table VLC-Table-3 vector 

corresponds to the fourth MSB and all the lower bit planes. The difference between the current macroblock and it's 

For the ESCAPE cases, RUN is coded with 6 bits, EOP is ^ prediction is coded using the DCT coding. In order for the 

coded with 1 bit. Escape coding is a method to code very decoder to reconstruct the current macroblock, the motion 

small probability events which are not in the coding tables vector has to be coded in the bitstream. Since there is a fixed 

individually. number of bits for coding a frame, the more bits spent on 

A i r *u u *ii c ii coding the motion vectors results in fewer bits for coding the 

An example of the above process will now follow. For 6 . , f - t . , . J? . 

M1 A 4 . r .„ r ..... *j i i motion compensated differences. Therefore, it is desirable to 

ilhistration purposes, we will assume that the residual values 45 lower ^ n ^ ba of bits for codi ^ motioQ vcctors and 

after the zigzag ordering are given as follows and N«6: The leave more Wts fof ^ ^ % mGTCtsCGS ^tween the 

following representation is thereby produced. currem macroblock its prediction. 

For each set of 2x2 motion vectors, the average motion 

10, 0, 6, 0, 0, 3, 0, 2, % 0, 0, 2, 0, 0, l, 0, ... 0, 0 vector can determined and used for the four macroblocks. 

50 In order to not change the syntax of the base layer coding, 

The maximum value in this block is found to be 10 and four marcoblocks are forced to have the identical motion 

the minimum number of bits to represent 10 in the binary vectors. Since only one out four motion vectors is coded in 

format (1010) is 4. Therefore, two all-zero bit-planes before the bitstream, the amount of bits spent on motion vector 

the MSB plane are coded with a code for the value 2 and the coding is reduced, therefore, there are more bits available for 

remaining 4 bit-planes are coded using the (RUN, EOP) 55 coding the differences. The cost for pursuing such a method 

codes. Writing every value in the binary format using 4 bits, is that the four macroblocks, which share the same motion 

the 4 bit-planes are formed as follows: vector may, not get the best matched prediction individually 

and the motion compensated difference may have a larger 

1, 0, o, 0, 0, 0, 0, o, 0, 0, 0, 0, 0, 0, 0,0, 0 (MSB-pbne) dynamic range, thus necessitating more bits to code the 

60 motion vector. 

0, o, i f 0, a 0, o, 0, 0, 0, 0, 0, o, 0, 0, OA 0 (Second MSB- p or a given fixed bitrate, the savings from coding one out 
plane ^ of four motion vectors may not compensate the increased 

,«,« rt -.n,,on, rtrtnrtrt « . v number of bits required to code the difference with a larger 

1, 0, 1, 0, 0, 3, 0, 1, 1, 0, 0, 1, 0, 0, 0, OA 0 (Third MSB-plane) , 7? - . * 

¥ dynamic range. However, for a time varying bitrate, a wider 

0, 0, o, 0, 0, 1, 0, 0, 0, 0, 0, 0, o, 0, 1, OA 0 (Fourth MSB-plane 65 dvnamic ran S e for the enhancement layer provides more 

or LSB-plane) flexibility to achieve the best possible usage of the available 

bandwidth. 
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Coding Sign Bits 

In an alternative embodiment of the present invention, if 
the base layer quantized DCT coefficient is non-zero, the 
corresponding enhancement layer difference will have the 
same sign as the base layer quantized DCT. Therefore, there 5 
is no need to code the sign bit in the enhancement layer. 

Conversely, if the base layer quantized DCT coefficient is 
zero and corresponding enhancement layer difference is 
non-zero, a sign bit is placed into enhancement layer bit- 
stream immediately after the MSB of the difference. An 10 
example of the above method will now follow. 
Difference of a DCT block after ordering 

10, 0, 6, 0, 0, 3, 0, 2, 2, 0, 0, 2, 0, 0, 1, 0, ... 0, 0 

Sign indications of the DCT block after ordering 

3, 3, 3, 3, 2, 0, 3, 3, 1, 2, 2, 0, 3, 3, 1, 2, ... 2, 3 

0: base layer quantized DCT coefficient^ and difference^ 
1: base layer quantized DCT coefficients and difference<0 20 
2: base layer quantized DCT coefficients and difference=0 
3: base layer quantized DCT coefficient! =0 

In this example, the sign bits associated with values 10, 6, 
2 don't need to be coded and the sign bits associated with 3, 
2, 2, 1 are coded in the following way: 25 

Codc(All Zero) 
code (All Zero) 

code(0,l) ^ U 
code(2,l) 

code(0 J 0),code(l J 0),code(2,0) ) 0,code(l,0),code(0 J 0) 3 l,code(2 > l),0 
code(5,0),code(8,l),l 35 

For every DCT difference, there is a sign indication 
associated with it. There are four possible cases. In the above 
coding 0, 1, 2, and 3 are used to denote the four cases. If the 
sign indication is 2 or 3, the sign bit does not have to be 40 
coded because it is either associated with a zero difference 
or available from the corresponding base layer data. If the 
sign indication is 0 or 1 a sign bit code is required once per 
difference value, i.e. not every bit-plane of the difference 
value. Therefore, a sign bit is put immediately after the most 45 
significant bit of the difference. 
Optimal Reconstruction of the DCT Coefficients 

In an alternative embodiment of the present invention, 
even though N enhancement bitstream layers or planes may 
have been generated, only M, wherein M^N enhancement 50 
layer bits are available for reconstruction of the DCT coef- 
ficients due to the channel capacity, and other constraints 
such as congestion among others, the decoder 80 of FIG. 1 
may receive no enhancement difference or only a partial 
enhancement difference. In such a case, the optimal recon- 55 
struction of the DCT coefficients is capable of proceeding 
along the following method: 

If decoded difference=0, the reconstruction point is the 
same as that in base layer, otherwise, the reconstructed 
difference -decoded difference+Vi*(l«decoded_bit_ 60 
plane) and the reconstruction point=reference point+ 
reconstructed difference *Q_enh+Q_enh/2. 

In the present embodiment, referring to FIGS. 3C and 3D, 
the optimal reconstruction point is not the lower boundary of 
a quantization bin. The above method specifies how to 
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obtain the optimal reconstruction point in cases where the 
difference is quantized and received partially, i.e. not all of 
the enhancement layers generated are either transmitted or 
received as shown in FIG. 1. wherein M§N. 
What is claimed is: 

1. A video encoding method for selectively adapting a 
video input for transmission over a specific channel of a 
network, wherein said network includes a plurality of 
channels, said channels having a plurality of different 
bandwidths, the method comprising the steps of: 

determining number N of enhancement layer bitstreams 
capable of being adapted to said particular bandwidth 
of said specific transmission channel of said network; 

encoding a base layer bitstream from said video input; 

encoding N number of enhancement layer bitstreams from 
said video input based on the base layer bitstream, 
wherein the N enhancement layer bitstreams comple- 
ment the base layer bitstream; and 

providing the base layer bitstream and N enhancement 
layer bitstreams to said network. 

2. The video encoding method according to claim 1, 
wherein the determining step includes negotiating with 
intermediate devices on said network. 

3. The video encoding method according to claim 2, 
wherein negotiating includes determining destination 
resources. 

4. The video encoding method according to claim 1, 
wherein the step of encoding the base layer bitstreams is 
performed by a MPEG-1 encoding method. 

5. The video encoding method according to claim 1, 
wherein the step of encoding the base layer bitstreams is 
performed by a MEPG-2 encoding method. 

6. The video encoding method according to claim 1, 
wherein the step of encoding the base layer bitstreams is 
performed by a MPEG-4 encoding method. 

7. The video encoding method according to claim 1, 
wherein the step of encoding the base layer bitstream is 
performed by a Discrete Cosine Transform (DCI^ method. 

8. The video encoding method according to claim 7, 
wherein after encoding the base layer bitstreams by a 
Discrete Cosine Transform (DCT) method a DCT coefficient 
is quantized. 

9. The video encoding method according to claim 1, 
wherein the enhancement layer bitstreams are based on the 
difference of an original base layer DCT coefficient and a 
corresponding base layer quantized DCT coefficient. 

10. The video encoding method according to claim 1, 
wherein the base layer bitstream and the N enhancement 
layer provide to the network are multiplexed. 

11. The method of claim 10 further comprising the steps 
of: 

demultiplexing said base layer bitstream and at least one 
of said plurality of enhancement layer bitstreams 
received from said network; 

decoding said base layer bitstream; 

decoding at least one of said plurality of enhancement 
layer bitstreams based on generated base layer 
bitstream, wherein the at least one of the plurality of 
enhancement layer bitstreams enhances the base layer 
bitstream; and 

reconstructing a video output. 

* * * * * 
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